Language Technologies in Humanities: Computational Semantic Analysis in Folkloristics
نویسندگان
چکیده
The paper discusses computational methods for natural language processing (NLP) and possibilities they offer to folkloristics. As folkloristic materials are very challenging for NLP, due to their specific semanticsyntactic structure, inherent dialectical diversity and strong intertextuality, a robust NLP method is needed that can account for topical distribution, detect general heterogeneity, and context. The focus of this paper is on computational semantic analysis (such as word-sense disambiguation, topic recognition) and its ability to uncover latent semantic structure of folkloristic corpora.
منابع مشابه
Software Projects for Developing Digital Humanities Resources
In this short paper we report on experiences gained from bachelor and master theses, and from a series of software projects conducted in cooperation with the Department of Computational Linguistics of the Saarland University. Those bachelor/master theses and software projects were dealing with the application of Natural Language Processing and Semantic Web technologies to the representation and...
متن کاملPreferred Lexical Access Route in Persian Learners of English: Associative, Semantic or Both
Background: Words in the Mental Lexicon (ML) construct semantic field through associative and/ or semantic connections, with a pervasive native speaker preference for the former. Non-native preferences, however, demand further inquiry. Previous studies have revealed inconsistent Lexical Access (LA) patterns due to the limitations in the methodology and response categorization. Objectives: To f...
متن کاملData repositories in the Humanities and the Semantic Web: modelling, linking, visualising
The paper discusses the inherent potential of the Semantic Web and its related technologies for humanities research. The focal point lies on the extraction of semantic relations from heterogeneous XML based scholarly corpora using a webservice based infrastructure (XTriples). Especially the creation of methodologically distinct semantic corpora stemming from data sets originating in the humanit...
متن کاملEnglish and Persian Sport Newspaper Headlines: A comparative study of linguistic means
Abstract Using rhetorical figures in specialized languages like the language of newspaper headlines is common. The present study attempted to conduct a contrastive analysis of the English and Persian sport newspaper headlines related to the 2014 FIFA World Cup. Toward this end, a corpus consisting of 400 English and 400 Persian headlines published during 12th of June to 13th of July, 2014 was c...
متن کاملEnglish and Persian Sport Newspaper Headlines: A comparative study of linguistic means
Abstract Using rhetorical figures in specialized languages like the language of newspaper headlines is common. The present study attempted to conduct a contrastive analysis of the English and Persian sport newspaper headlines related to the 2014 FIFA World Cup. Toward this end, a corpus consisting of 400 English and 400 Persian headlines published during 12th of June to 13th of July, 2014 was c...
متن کامل